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Abstract —This paper proposes a new method to provide per¬ 
sonalized tour recommendation for museum visits. It combines an 
optimization of preference criteria of visitors with an automatic 
extraction of artwork importance from museum information 
based on Natural Language Processing using textual energy. This 
project includes researchers from computer and social sciences. 
Some results are obtained with numerical experiments. They 
show that our model clearly improves the satisfaction of the 
visitor who follows the proposed tour. This work foreshadows 
some interesting outcomes and applications about on-demand 
personalized visit of museums in a very near future. 

I. Introduction 

USEUMS are no longer only institutions that acquire, 
store and expose our heritage. Going to a museum is a 
learning activity but also an enjoyment for visitors. With the 
emergence of the Web, curators and cultural mediators decided 
to get involved in collaborative and numerical culture to attract 
a larger public. Today, almost all museums have a website but 
few of them allow the visitors to prepare their visit in the best 
conditions. 

Some art, science and society museums are collaborating 
with research laboratories to develop new technologies that 
improve services in museums in response to the desires of 
existing and potential visitors. 

However, there are still difficulties, epistemological barri¬ 
ers, to study the expectations and the intentions of different 
publics, including online visitors. Knowing why people want 
to come and visit museums could allow automatic systems to 
suggests their tour, save their time and give them the best of 
the knowledge of the exhibited arts. 

Among all possibilities, a recommendation system for per¬ 
sonalized routing is by far one of the best improvements. 
Indeed, some museums exhibit thousands of artworks and it is 
not conceivable for a visitor to admire all of them because he 
might spend time in front of artworks which do not match 
his interests and he might not be able to see other more 
interesting artworks due to tiredness or a lack of time. A few 


museums, as The Louvre, offer a recommendation systerrfjjbut 
they are limited to the selection of a route in a pre-established 
set. Moreover, in this particular case, the personalization is 
restricted to the selection of a theme and the duration of the 
visit in a set of no more than 10 themes and 4 different 
durations. 

It is essential to propose a personalized route for each 
visitor or group of visitors according to their interests while 
taking into account their constraints such as limited schedule, 
physical handicap or a list of artworks to include of the tour. 
This operation may also reduce unuseful moves (avoid round 
trips). But to calculate an optimal tour, we need to assess the 
visitor interest for each artwork by asking his preferences. 

Modeling the preferences with random distributions may not 
reflect reality because curators take care of the scenography 
(therefore the coherence) of each room. So we worked on 
prefered artists (the visitor can select a set of interesting 
artists) and we propose to use the artworks description to 
highlight a kind of intrinsic interest from the point of view of 
the museum. Indeed, the description displayed to the visitor 
should show how significant is the artwork for the museum. 
We valuate each item by analyzing their description (with 
Automatic Text Summarization) and use it as a base value, 
considering that even without any preference, some artworks 
are more interesting than others. 

The Musee de I’Orangerie 

Due to the time needed to extract and check all the data we 
worked on this small museum to test our model. 

The Musee de l’Orangerie (Museum of Orangerie), in 
Paris (France), regroups 144 artworks from 14 artists in 10 
exhibition rooms. The web si t^] of the museum supplies a 
map (shown in Fig. |T|) and indexes information about all the 

1 http://www.louvre.fr/en/parcours 

- http://musee-orangerie.fr 



UPPER FLOOR: Water Lilies 



LOWER FLOOR: Jean Walter - Paul Guillaume Collection 



Fig. 1. Map of the Musee de l’Orangerie 


artworks including the name of the artist, a description of the 
artwork and its date of creation. 


Paper organization 

The remaining of this paper is organized as follows. We 
review the related work in Section [II] In Section [IIIJ we 
present a Natural Language Processing (NLP) based approach 
to compute artwork interest. The Personalized Tour Recom¬ 
mendation Problem is presented in Section [IV] To solve this 
problem, we develop an Integer Linear Programming based 
method to solve the tour optimization problem in Section 
|Vl and define a model to represent the visitor preferences 
in Section [VI] The simulations are conducted and numerical 
results are demonstrated in Section [VTI| Finally, we conclude 


the paper in Section VIII 


II. Related work 

A first model developed in 2010 HI proposes to formulate 
the visitor routing problem as an extension of the open shop 
scheduling problem (in which each visitor group is a job and 
each interesting room is a machine). Each visitor group has to 
pass through all rooms but it is impossible for two groups of 
visitors to be simultaneously in the same room. This restriction 
can lead to non optimal or infeasible solutions if there are more 
visitor groups than rooms in the museum (which is the case 
if we consider each single visitor as a group). 

Relying on the constraint programming model 0, we 
propose to reduce the number of used variables. In 0, they 
generate a route by calculating the smallest number K of 
steps required to cross the museum (to visit all the rooms). 
This model requires that each artwork is represented as K 
variables (one per step). Due to the fact that museums often 
have several thousands of artworks, it leads to a huge number 
of variables. Moreover they use mathematical distributions to 
simulate a visitor profile which does not necessary reflect 


reality (in museums, artworks are often grouped in a room 
because they are related to each other, a configuration that a 
random distribution as they used cannot represent). 

In 2013, some works 0 used the visiting style of visitors 
(the way a visitor go from an artwork to another) but their 
model requires two matrices of size N 2 (where N is the 
number of artworks). The first one indicates the accessibility 
to an artwork from another (if they are in the same room or 
in two rooms directly connected) and the second one contains 
the distance between two artworks. However as the number 
of artworks is always greater than the number of rooms, 
most of the museums are modeled as two sparse matrices 
with duplicated data (in a room, it is often allowed to freely 
move between artworks). This makes the use of constraint 
programming expensive. 

III. Artwork description analysis using Textual 
Energy 

Our idea is to use the description of each artwork as an 
independent measure of their interest. Indeed, two similar 
artworks (same theme, support, artist) will produce the same 
result but may be very different. By analyzing the description 
provided by the museum, we tried to differentiate them. 

Automatic Text Summarization (ATS) techniques by ex¬ 
traction 0, 0, 0 allow to rank a set of textual segments 
(sentences, paragraphs etc.) depending on a measure of sim¬ 
ilarity. Textual Energy algorithm (Enertex) converts a textual 
document into a physical object and use Statistical Physics to 
measure its energy [7[. This energy, to which we should refer 
as Textual Energy, is then computed and apply to summariza¬ 
tion. The physical model of Textual Energy gives rise to a 
single non iterative algorithm of low complexity. Therefore 
Textual Energy allows to redefine sentence ranking on simple 
and efficient matrix operations. The resulting algorithms are 






































much easier to apply to large texts and give better results 
without using any post-processing. 

A. Starting point: Hopfield Model 

Hopfield’s approach J8), |9] was based on magnetic Ising 
model to build a Neural Network (ANN) with pattern learning 
capabilities. The capacities and limitations of this ANN (an 
associative memory), were well established in a theoretical 
framework |(8], (9): the patterns must not be correlated to 
obtain free error recovery, the system saturates quickly and 
only a little fraction of the patterns can be stored correctly. 
Despite these major drawbacks, Hopfield contributed to ANN 
theory by introducing the concept of energy by analogy with 
magnetic systems. A magnetic system is a set of N spins like 
small magnets that can adopt several orientations. The simplest 
model is the dipole one or Ising model where there are only 
two opposite possible orientations: up (f or +1) or down (| or 
0). Ising magnetic model was used in a large variety of systems 
that can be completely described by a set N of binary variables 
(HD with 2 n possible configurations (patterns). The spins are 
equivalent to neurons that can interact following Hebb’s rulc£] 

p 

Ji,j = 'y ' i s p,i x s p,j) (1) 

f*=i 

and are the states of neurons i and j in the pattern //.. 
The summation concerns the P patterns to store. This rule of 
interaction is local, because J l t depends only on the states 
of the connected unities. It has the capacity to store and 
to recover certain number of configurations of the system, 
because the Hebb rule transforms these configurations into 
attractors (minimal local) of the energy function GD: 

1 N N 

E^ = .. ^ ^ ' (Si X -If j X Sj) (2) 

t = 1 3 =1 

The fundamental concept of magnetic energy is a function 
of the system configuration, that is, of the state of activation 
or non-activation of its units. The concept of energy induces 
a type of interaction. If we present a pattern v, every spin 
will undergo a local field: h l = Y2jL i ■J’"' 1 x 1 induced by the 
energy of the others spins. Therefore the total energy of the 
new system made of the new pattern inserted into the previous 
system reflects the interaction between the pattern and the 
initial system. 

We shall focus on theoretical objects that are usually consid¬ 
ered in Statistical Physics. In magnetic system analysis, these 
are energy function distributions HD. Hopfield himself used 
these functions to show that the recovery is convergent. Our 
Enertex system is entirely grounded on them. 

B. Energy as a document similarity measure 

The Vector Space Model (VSM) has also been applied 
to texts since m following a bag of word representation 
of sentences. In this model vectors represent sentences and 

3 Hebb m suggested that synaptic connections change according to the 
correlation between neuronal states. 


a document gives rise to a matrix. We have used VSM to 
represent documents in our model magnetic system: a sentence 
(a row vector) is equivalent to a Ising spin chain and a 
document (a magnetic system) is represented by a matrix of 
P rows x N columns. Therefore, sentences can be studied 
as Ising spin chains. More formally, with a vocabulary of 
N words (terms) in a document, it is possible to represent 
a sentence as a chain of N spins, i = 1, • • • , N. A document 
with P sentences is formed of P chains in the vector space 
S of dimension N. In this paper, the description of each 
artwork is assimilated as a long pseudo-sentence. Therefore, 
a document (the collection of a museum) is constituted of a 
set of P (pseudo-)sentences. 

Documents are preprocessed by removing functional words 
(by using a stop list), normalized and lemmatized m. M- 
This preprocessing reduces considerably the document dimen¬ 
sionality. Let be T = {ii,...,tjv} the set of remaining 
terms after this preprocessing. Once segmented into units, 
usually sentences, the text is represented by a set S = 
{Si, ..., s, L ,..., Sp} where each s M is the bag of words in 
segment /(. As usual in text vector model, we consider the 
matrix S[ PxN ] = ( s p,j)i<p<P,i<j<N of frequency/absence 
associated to H by: 

_ f tfp-j ^ A? £ sentence p 
S ^’ j \ 0 otherwise ( ' 5) 

where is the term frequency of tj in the sentence p. 

We therefore consider the presence of term tj as a spin Sj 
j" with magnitude tf^j (its absence by a j, respectively), and 
a description of each artwork (text segment) by a chain of N 
spins. 

It is common to consider that these vectors are correlated 
according to the shared words. Here the introduction of the 
magnetic model induces moreover indirect interactions. In this 
model sentences that do not share any word could however 
interact because of the magnetic field generated by the other 
sentences of the document that form the global magnetic 
system. 

We have studied the interactions between the terms and the 
sentences using Hebb’s mle and Ising energy respectively. To 
obtain the matrix J of interactions between the N terms, we 
apply Hebb’s rule (equation |T]) in its matrix form: 

J = S T xS (4) 

where , is the number of co-occurrences of terms in 
sentences. The energy function (equation [2} of a (magnetic) 
system S is: 

E = SxJxS T = {Sx S t ) 2 (5) 

Each element E represents the energy between sentences 
p and v. The values in the first matrix diagonal quantify the 
interaction energy between words into a sentence meanwhile 
the other values in the rest of the matrix show the interactions 
between distinct sentences. The sum of absolute values in one 


row gives the total energy of interaction of the corresponding 
sentence p with the document: 

= 5 >„,„| (6) 

V 

We use this energy value to rank sentences (description 
of artwork) by order of decreasing importance. The most 
energetic will be considered as the most important. 

IV. Personalized Tour Recommendation Problem 

The Personalized Tour Recommendation Problem (PTRP) 
can be viewed as an optimization problem and solved by 
optimization techniques. For this purpose, we first model the 
museum topology as a graph and then formulate the studied 
problem as an Integer Linear Programming (ILP) instance. 
Therefore, the optimal personalized tour can be obtained by 
solving the ILP model we propose. 

A. Museum modeling 

A museum is modeled as a 7-tuple G = ( V, A, E , X , P, r) 
where: 

• V is the set of vertices, each vertex is an exhibition room, 
an entrance or an exit of the museum. 

• A is the set of arcs which connect different rooms. There 
is an arc a t j £ A between two vertices i and j, if we 
can go from room i to room j directly without passing 
through other rooms. 

• E is the set of entrances of a museum, which is a subset 
of V, i.e. EcV. 

• A' is the set of exits of a museum, which is also a subset 
of V, i.e. X C V. 

• P is the set of all artworks in the museum. 

• r is a mapping function P —► V. For each artwork p £ P, 
r(p) is the room containing p. 

Some large museums may have several entrances and exits, 
that is why E and X are two subsets of V. We also admit 
that there is always a path from any entrance to an exit. We 
consider directed arcs and not edges because some museums 
may impose a flow direction for several reasons (minimizing 
congestion, pedagogical tour). Note that by definition of A, 
there is no incoming arc to any entrance and there is no any 
outgoing arc from any exit neither. 

Application to the Musee de VOrangerie: The Musee de 
FOrangerie can be represented as the graph presented in Figure 
[2] We can see that there is only one entrance and one exit in 
the museum and they are located at the same place. Therefore, 
we consider the entrance and the exit as two different vertices 
in the graph to facilitate the model. The mapping between 
vertices and rooms is shown in Table 0 

B. Personalized tour problem modeling 

For the sake of satisfying the visitor maximally, a visit tour 
should be proposed according to the visitor’s preferences and 
constraints. 

A personalized tour problem can be defined as a 6-tuple 

(G,I,R,u,t,T M Ax) where: 



Fig. 2. A possible graph for the Musee de l’Orangerie 
TABLE I 

The Musee de l’Orangerie: vertices and rooms 


Vertex 

Room 

E 

Entrance 

X 

Exit 

H 

Hall 

Wi 

Water Lilies (first part) 

W 2 

Water Lilies (second part) 

1 

L’Age d'Or 

2 

Paul Guillaume Room 

3 

Impressionism 

4 

Modern primitives 

5 

Laurencin room 

6 

Modern classicism 

7 

Derain room 

8 

Soutine / Utrillo room 


• G is the museum graph representing the museum topol¬ 
ogy as defined above. 

• I is the set of artworks which have to be included in the 
tour. 

• R is the set of artworks which have to be excluded of 
the tour. 

• it is a mapping u : P —> R + . For each artwork p £ P, 
u(p) denotes the interest of the visitor for the artwork p. 

• t is a mapping t : A U V U P — > R + . For each room, arc 
and artwork, we have a time to spend. It can be the time 
needed to cross a room or an arc. It can also be the time 
to see an artwork. 

• Tmax is the maximum time that a visitor wants to spend 
in the museum. 

A visit tour may be a simple path without any cycles (an 
elementary path) or a sophisticate path including cycles (a 
non-elementary path). 























We define a tour as a sequence of pairs (v,P v ) where 
v £ V and P v C r(v) (note that P v may be 0 because 
we can cross a room without seeing any artwork). A tour 
T = ((vi,P Vi ),{v n , P Vn )) is a solution to the personalized 
tour recommendation problem when: 

1) The vertex v in the first element of T is an element of 
E (the tour starts by an entrance). 

2) The vertex v in the last element of T is an element of 
X (the tour ends by an exit). 

3) All consecutive elements T) and !'■> of T share the same 
vertex or an arc a 7y £ A must exist from the vertex i of 
Ti to the vertex j of X 2 . 

4) The total time required to see all the artworks and pass 
through all the rooms (and ways) is not bigger than 

Tmax- 

Application to the Musee de I’Orangerie: we may have a 
visit tour like the following: 

((f? ) 0),(H,0),<W 1 ,p 1 ),(W2,P5),(Wi,0),(fr,0) J 

(l,p w ),{8,p W3 ),(h®),(H,<b),(X,m 

In this tour, the visitor should cross the receiving hall H 
three times, exhibition room W\ and 1 twice respectively. 
Although we may traverse a room several times, the visitor 
is supposed to visit the room only once. Consider for instance 
the exhibition room W\, we may visit the selected artworks 
when we reach this room for the first time. The second time, 
we would just cross the room to visit another one. 

V. Integer Linear Programming approach to solve 
the Optimal Personalized Tour Recommendation 
Problem 

Before introducing an ILP model to solve the Personalized 
Tour Recommendation Problem, we define several decision 
variables: 

• x p equals 1 if the artwork p £ P is included in the 
proposed tour, 0 otherwise. 

• y a equals 1 if arc a £ A is crossed in the proposed tour, 
0 otherwise. 

• f a denotes the number of rooms crossed when we arrive 
at arc a £ A in the visit walk. 

• z v equals 1 if room v £ L is traversed in the proposed 
tour (no matter whether we visit an artwork of this room 
or not), 0 otherwise. 


Given a personalized tour problem (as defined in section 


IV 1 , the objective function of the Optimal Personalized Tour 


Recommendation Problem (OPTRP) is to maximize the overall 
satisfaction of the proposed visit tour for the visitor: 



max ^2 x p x 

u(p) 

OPTRP 

(7) 
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Constraints ([8]) and (|9| ensure that the visitor should enter a 
museum from a unique entrance and finish the visit by a unique 
exit respectively (this model considers the case of multiple 
entrances and exits). Constraint ( [T()| makes sure that a visitor 
should exit a room v after crossing or visiting it. Constraint 
(111 expresses that a visitor should have crossed at least a room 
before arriving at an arc a, while constraint (12 1 imposes that 
no flow is moving on the arc a, if it is not crossed in the 
visit tour. Constraint m means that if a room v is crossed in 
the tour, then the number of rooms crossed before arriving at 
this room equals the number of rooms crossed after leaving v 
minus one. Otherwise, they should be equal, since the room 
will not appear in the tour. Constraint © imposes that a 
room v should be crossed as long as one input arc or one 
outgoing arc is crossed. Constraint (14 1 ensures that a room 
v should not be crossed if none of the input arc or output 
link is used during the visit. Constraint ( p~6] > indicates that if a 
room v is not crossed, none of its artworks will be proposed 
for visiting. Constraints and ( fl8| ensure that an artwork 
should be included or excluded from the proposed tour if the 
visitor asks for it. The last constraint ( [T9| guarantees that the 
time spent in front of the artworks and the time required to 






pass through rooms (and ways) does not exceed the available 
time for the visitor. 

The ILP model we propose provides a visit tour starting 
from an entrance and terminating at an exit. In 0, authors 
also proposed an ILP model to plan the personalized visit. 
They divided the studied proposed into two sub-problems: first 
determine the number of moves (denoted as K) for a complete 
walk in the museum graph, and then solve the museum routing 
problem while maximizing visitor satisfaction. Since both of 
these sub-problems are NP-Hard, authors of 0 proposed to 
solve both of them by constraint programming. The complex¬ 
ity of their model depends a lot on K , which is generally 
large (at least equals to |V|). To compare the complexity of 
our model with the ILP mode in 0 , the number of variables 
and constraints are listed in Table [TT] 

VI. Visitor preferences modeling 

The interest function u should reflect the satisfaction of the 
visitor for each artwork p £ P. The nearer to his preferences 
is an artwork p, the greater is u(p). 

Representation of artworks and the visitor preferences 

We define C p as the set of all characteristics of an art¬ 
work p and C as the union of all these sets (i.e. C = 
{c|c £ C v Mp £ P}). A characteristic may be the theme, the 
type of support, the date of creation, the name of the artist or 
anything that identify an artwork. 

We can represent any artwork p £ P as a caracteristics 
vector v p = (v pi ,..., v Pn ) in a vector-space of \C\ dimensions. 
Each element v Pi £ M + in the vector is a numerical value 
measuring the relevance of the artwork p to the associated 
characteristic. Additionally we define a vector v in the same 
vector-space as the vector representing the visitor preferences 
(where each element of v measures the interest of the visitor 
for the associated characteristic). 

Measuring the interest for an artwork 

To identify the interest for the visitor to an artwork, we 
compare v and v p with the cosine similarity which calculate 
the angle between two vectors. The formula is the following: 

~\Tl 

similarity(v p ,v ) = — . 1 —— . ' (20) 

Vex >< Vex 

The resulting similarity ranges from 0, meaning that the 
visitor is not interested at all by the artwork, to 1 meaning 
that the artwork exactly matches his preferences. 

In our model, we used u(p) = similarity(v p ,v) where 
v p and v are the vectors of the artwork p and the visitor 
preferences respectively. 

VII. Simulations and numerical results 

We implemented the ILP model described in section |V] by 
using the IBM CPLEX 12 library^ 

The program takes several input parameters: 

4 http://www-01.ibm.com/software/commerce/optimization/cplex-optimizer/ 


The graph modeling the museum as defined in section IV 


• The interest function / to use. This function produces 
interest vectors as defined in section [Vi] 

• The maximum duration that a visitor can spend in the 
museum 

It outputs the proposed tour (as defined in section |IV|). 


A. Intrinsic interest 


As we saw in section III the Enertex algorithm ranks the 
sentences of a document. We used Enertex as the following: 


1) From the website of the museum, we created an XML 
file containing the following information for each art¬ 
work: the title, the artist name, the year and the descrip¬ 
tion of the artwork. 

2) We extracted data from the XML to produce a file where 
each pseudo sentence is a concatenation of title, artist 
and description. 

3) The latter file is used as an input to Enertex with 
the query ’’musee orangerie peinture impressionniste 
postimpressionniste” to drive the balancing process of 
the system. 


It produces a ranking for artworks depending on the infor¬ 
mation displayed by the museum (for each artwork, the result 
is a value ranging from 0 to 1). 

Fig. [3] shows the ranking of the artworks in the Musee de 
1’Orangerie provided by Enertex. 



Artwork's number 

Fig. 3. Ranking by Enertex of the artworks in the Musee de TOrangerie 

As we can see, the resulting ranking is in agreement with the 
information provided by the museum. Indeed the masterpieces 
(according to the website of the museum) represent the most 
important artworks (which have the highest scores). 











TABLE II 

Comparison of ILP Models 


Terms 

OPTRP ILP 

ILP gj 

Variables 

Number of variables 
Constraints 

Number of constraints 

•Tpi Vat fai 

\P\ + 2|A| + |Vj 

3 + 4|A| + |P| + ifVTTVI - 2 (\ E \ + 1*1) 

Xp, Xp^jc, 

(\P\ + \A\)xK+\P\ 

(2)-(8) in O 

|A| x (K - 1) + |P| x (2 K - 1) + K + 3 


B. Interest functions 

Four different interest function were designed to simulate 
the visitor preferences. 

• f\. produces the same vector v - ( 1 ) for each artwork 
and visitor preferences 

• f 2 - produces a vector v p = ( s p ) where s p is the score 
given by Enertex for the artwork p and produces a vector 
v = ( 1 ) as the visitor preferences. 

• / 3 : produces vectors v = (vi, ...,v n ) of size n equals to 
the number of artists. Each artwork is represented as a 
vector where u* = 1 if the artwork is created by the artist 
i, 0 otherwise. The visitor preferences are represented as 
a vector where Vi = 1 if the visitor is interested by the 
artist i, 0 otherwise. 

• / 4 : produces vectors v = Vf 2 1| 10 x Vf 3 where Vf 2 and v / 3 
are the vectors produces by f 2 and / 3 respectively. 

The first function defines the baseline: the visitor has no in¬ 
terest at all. The second uses the ranking provided by Enertex: 
the visitor wants to discover the most important artworks of 
the museum. The third uses the visitor preferences (a set of 
interesting artists). The last combines both visitor preferences 
and museum point of view (we multiply by 10 because we 
want to give more importance to the visitor preferences than 
to the museum point of view). 

C. Evaluation 

To evaluate the output tour, we measure the relevance 
percentage defined as : 


100 * EpgT f(p) 
J2 p gp f(p ) 


( 21 ) 


where T is the set of artworks proposed in the tour (i.e. 
TCP ) and / the interest function used (as saw above). 
The relevance percentage rp denotes a satisfaction rate of the 
visitor. 


D. Results 

For each function /, we ran the program with different time 
limits from 30 to 330 minutes (the time required to visit the 
entire collection) by steps of 15 minutes. For f 2 and / 3 , we 
randomly pre-generated 5,000 combinations of 2, 3, 4 and 5 
artists (i.e. the same combinations are used with f 2 and / 3 ) 
and calculated the arithmetic mean relevance percentage for 
each duration. 

Figure [4] shows the evolution of rp for each function /. 
As expected, the first interest function /1 produces a linear 
evolution of the relevance percentage (given that all artworks 


have the same interest, the tour includes the greatest number of 
artworks). With Enertex we are able to propose efficent tours to 
visitors who want to discover the museum (without particular 
preferences). The combination of both visitor preferences and 
intrinsic preferences produces the best results up to 49% of 
relevance improvement. It also appears that after 150 minutes, 
the improvement is less significant, we could assume that, 
from the visitor point of view, the optimal tour duration is 
about 2 hours and a half. 

VIII. Conclusion and future works 

This research tackles the problem of optimizing museum 
visits according to visitors preference and artwork importance. 
As a first milestone for next works taking into account the 
individual behavior in museum visits, it sets an original model 
combining computational optimisation and automatic learning 
via artificial intelligence. We first drew the optimization frame¬ 
work based on graph theory to depict the spatial organization 
of the museum (including rooms and paths), that requires an 
Integer Linear Programming to maximize the visitor overall 
satisfaction and to generate an optimal path, that is to say 
a series of rooms and artworks to be seen by the visitor. In 
complement, we compute an artwork description analysis by a 
natural language processing based on textual energy (using an 
algorithm called Enertex). This leads to ranking the different 
artworks according to the descriptions given by the museum, 
related to their artistic importance. Associating those two 
complementary approaches, we are then able to design optimal 
paths for visitors according to different interest functions based 
on artwork objective values assigned by museums. 

Future works concern more subjective behavior of visitors 
depending on their profiles and leisure practices. Indeed, the 
project aims at finding relevant recommendations for optimal 
visit tours that rise a better fitness between the visitor wishes 
and the museum artistic supply. We can think about using nat¬ 
ural language processing to generate the set of characteristics 
for all the artworks in a museum and calculate better interest 
vectors but also produce a summary of the proposed tour. 

This information may advantageously be used by existing 
and potential visitors to refine the way they get involved in 
their cultural pratices. Indeed, it is admitted that the museum 
connoiseurs use to develop a critical mind about new services 
in a numerical society. Thence, aware visitors become able to 
appreciate the personalized routing recommendation system 
provided by their prefered museums. 
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Fig. 4. Evolution of the relevance percentage 
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